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Abstract. We investigate a network model based on an infinite regular square lattice embedded in the 
Euclidean plane where the node connection probability is given by the geometrical distance of nodes. 
We show that the degree distribution in the basic model is sharply peaked around its mean value. Since 
the model was originally developed to mimic the social network of acquaintances, to broaden the degree 
distribution we propose its generalization. We show that when heterogeneity is introduced to the model, it 
is possible to obtain fat tails of the degree distribution. Meanwhile, the small- world phenomenon present in 
the basic model is not affected. To support our claims, both analytical and numerical results are obtained. 
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1 Introduction 



Networks are powerful tools for representation of many 
diverse systems arising in physics, biology, and sociology. 
Progress in this field is rapid; good reviews of our current 
knowledge are presented in [1,2,3 . In this work we inves- 
tigate a network which is embedded in an Euclidean space 
where the probability that two nodes are connected by a 
link depends on their mutual distance. A similar model 
was first proposed by Kleinberg in [4]. Later, a model 
based on a regular underlying lattice was proposed and 
some numerical results were obtained [5]; very recently, 
this work has been generalized by introducing hidden vari- 
ables 1^. In [71IH], similar models with wiring costs de- 
pending on distances are studied; in p fT0HTT[[12[|13j . the 
interplay between geographical distance and node degree 
is investigated. 

Models mentioned above have one feature in common: 
resulting networks consist of many short links and long 
distance connections are less numerous. Notice that this 
corresponds to the picture widely accepted by sociologists 
investigating networks of acquaintances [MllTS] . Their key 
phrase "Strength of weak ties" has a straightforward in- 
terpretation here: the probability of connecting two nodes 
must decrease with the distance slow enough to enable 
multiple long links. Then the resulting network resem- 
bles the structure observed in the human society. Our 
present understanding of this phenomenon agrees with 
early mathematical insights presented in [16j where im- 
portance of multiple edge length scales was discussed. No- 
tice that also the classical model of Watts and Strogatz 
with two types of links [17\ follows a similar pattern. 
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In this paper we deal with the network model based 
on the distance dependent connectivity which was inves- 
tigated in [18] and is similar to [5]. This model was de- 
veloped to mimic the acquaintance network in a human 
society. It allows us to estimate the typical degree of sep- 
aration between distant vertices in the network — the re- 
sults show that with a proper choice of the dependence 
between the linking probability and the nodes distance, 
the network exhibits the small- world phenomenon. 

However, in the original work the degree distribution 
P{k) was not a matter of interest. In this paper we show 
that it can be approximated by a Gaussian distribution. 
This result is not surprising because the model relies on 
the edges whose presence is mutually independent in the 
same way as it is in the classical random graph of Erdos 
and Renyi. Moreover, we show that the distribution of 
P(fc) is rather narrow. By contrast, when we investigate 
the number of persons's acquaintances in a real society, 
the distribution decays slowly. This observation and the 
lack of diversity in the original model were our main moti- 
vations for the presented work. The basic "homogeneous" 
model is generalized by introducing hidden variables which 
is a common approach in various network models |191l20j , 
a similar attempt was recently presented in [5]. We in- 
vestigate the tail behavior of the degree distribution and 
show that the resulting network exhibits the small-world 
phenomenon. 



2 The basic model 

We assume that nodes of the graph form an infinite square 
lattice in the Euclidean plane with the side length of the 
elementary square equal to 1. When modeling a society. 
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each node represents one person and thus in this way we 
assume a homogeneous distribution of population. The 
probabihty that two vertices with the distance d are con- 
nected by an edge we label as Q{d). Notice that this is the 
point where we introduce homogeneity to the network: the 
probability Q{d) is the same for every pair of nodes sepa- 
rated by the distance d. The degree fc of a node is defined 
as the number of edges connected to this node. Conse- 
quently, the average node degree z := (k) is given by the 
linking probability Q{di) summing over all nodes i. When 
Q{d) changes slowly on the scale of 1, the summation can 
be replaced by an integration and thus 



E 



Q(r) dr 



that these three observations allow us to guess the precise 
form of Q{d). We merely suppose that our choice is able 
to capture basic features of the human acquaintances net- 
work. More detailed discussion on the nature of Q{d) can 
be found in [l8j . 

In addition to the clustering coefficient defined above, 
another important characteristics of random networks is 
the degree of separation (or equally the shortest path 
length). It is defined as the minimal number of vertices 
along the shortest path between two given nodes. Denot- 
ing the geometrical distance of these two nodes as ^, in the 
original paper it was shown that for Q{d) given by Eq. ([3|), 
the typical degree of separation of distant nodes is 



Q{r)2Trrdr. (1) 



^(0 « - 



lnQ(0 
In z 



(4) 



Notice that with z given, Eq. ([T]) represents a normaliza- 
tion condition for Q{d). 

For a node with the degree fc, the clustering coefh- 
cient C is defined as the ratio C := n/{^ where n is 
the number of edges between the neighbours of the given 
node. Notice that < C < 1. For a particular node 
X and a given function Q(d), the average value of n is 
{'^) = \ Q{dxi)Q{dij)Q{djx)- Here the factor \ cor- 
responds to the fact that by a plain summation over all 
i ^ j a, doublecounting occurs {i ^ j). Consequently, the 
average clustering coefficient of the network can be ap- 
proximated as 



Since in two dimensions, the typical distance I scales with 
the network size S as ^/S. Consequently, for the distance 
dependence given by Eq. ([3]), the typical topological dis- 
tance of two nodes in the network scales as Z? In S. 
For the human acquaintances network is = 6 • 10^ and 
hence I w 80 000; when a lies in the range 2.5 — 3.5, and 
z in the range 50 — 500, we obtain D in the range 3 — 10. 
In addition, by numerical integration of Eq. for the 
described parameters the mean clustering coefficient lies 
in the range 0.05 — 0.30. We can conclude that the given 
network exhibits the small-world phenomenon. 
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z(z-l) 



I! Q{dx^)Q{d^,)Q[d,x)dr,dr,. (2) 3 The degree distribution 



Here we again assumed that Q{d) changes slowly on the 
scale of 1. 

In [18], Q{d) was assumed to have the form 



Q{d) 



1 



1 -I- 



(3) 



with a > 2 to allow a proper normalization according 
to Eq. ([1]). This choice was motivated by the following 
observations of human society: 

1. When two persons live close to each other, they prob- 
ably know each other. Thus we require (3(0) = 1; 

2. The greater is the distance between two persons, the 
smaller is the probability that they know each other. 
Thus Q{d) should be a decreasing function of d; 

3. We define the average number of distant people that 
every person knows as Nd = Q(r)2'!Tr dr where R 
is large and fixed. We demand Nd sufficiently high to 
reflect the observation that many people have distant 
friends (e.g. living on the opposite Earth hemisphere). 

For example, Q{d) = exp[— M] satisfies (i) and (ii) 
but if we choose R that covers half of human population 
{ttR'^ « 3-10^) and z « 200 (which is a reasonable value to 
model real acquaintances), we obtain Nd ~ 10""'^^ which is 
effectively zero. Consequently, Eq. Q represents a simple 
choice of Q{d) which for a in the range 2.5 — 3.5 complies 
with the requirements written above. Yet, we do not claim 



Let's choose one node of the network, we label it as X. 
The plane can be divided into thin concentric rings cen- 
tered at X. If the ring radius is r and its width is w, it 
covers approximately N = 2nr'w vertices. Meanwhile, all 
vertices in one ring have approximately the same distance 
from X. Therefore they also have approximately the same 
probability Q{r) :— p to be connected with X . Since links 
are drawn independently, the number of neighbours of X 
in the ring with radius r, n(r), is a random quantity with 
the binomial distribution whose mean is Np and the vari- 
ance is Np{l — p) = 2-KrwQ{r)[l — Q(r)]. 

The degree k of node X is obtained by summing n[r) 
over all rings. The central limit theorem applies here and 
thus k is normally distributed and its variance cr^ is the 
sum of variances of all contributions nir). Replacing the 
summation over all rings by the integration we obtain 
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2TrrQ{r) [l - Q{r)] dr = 2z/n. 



(5) 



To confirm this result numerically, in tab. [l]the quantity 
na'^/z is shown for various values of z and n. As can be 
seen, the numerical results are well approximated by the 
analytical prediction na'^/z = 2 for a wide range of pa- 
rameters. 

We can conclude that the node degree k has approx- 
imately the Gaussian distribution with the mean z and 
the variance 2z/n. For values of z resembling a real soci- 
ety (z of the order of hundreds) it follows that ak ^ z 
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2 = 50 


z = 150 


z = 500 


a 


= 2.5 


1.96 


1.94 


1.88 


a 


= 3.0 


1.97 


2.03 


1.93 


a 


= 3.5 


2.00 


2.05 


2.00 



Table 1. Numerical estimates of naf./ z for various values of a 
and z on the square lattice with the dimensions 1 000 x 1 000 
(4000 X 4 000 for a = 2.5), the variance cr^ is obtained from 
10 000 reahsations of the model. 



and thus the degree distribution is sharply peaked around 
its mean value (narrowness of the degree distribution is 
clearly visible in Fig. [T]). This is in a clear contradic- 
tion with the empirical studies [2T l [22 l [23 l l24] which sug- 
gest power- law behavior. The resulting social network is 
strongly homogeneous — it lacks nodes exceeding others in 
degree by orders of magnitude. In the following section we 
investigate how this basic model can be modified to pro- 
duce a heterogeneous network and exhibit a broad degree 
distribution. 



4 Heterogeneous network model 

The probability distribution Q{d) given by Eq. ([3]), funda- 
mental for this model, has two natural parameters: h and 
a. Heterogeneity can be introduced to the network by as- 
signing random values of these parameters to each node 
(with the constraints 6 > 0, a > 2). To keep the acquain- 
tance relation symmetric we symmetrize the probability 
Qij id) that persons i and j with the distance d know each 
other by the relation 



Qij{d] bi,ai, bj,aj) 



Q{d; hi, ai) + Q{d; bj,aj) 



(6) 



To simplify our calculations we assume that a is fixed in 
the network and only 5 is a random quantity drawn from 
the distribution g(b). The parameter b we call the node 
solitariness (as b grows, the number of acquaintances is 
decreasing and their average distance is getting smaller). 
The average degree of a vertex with the solitariness b is 
now 



z{b) 



Q{r;b)+Q{r;b') 



Q{r;b)'!Tr dr + 
2 



2TrrQ{b')drdb' = 

Z{b)) TT^b-^" Z 



Q!sin(27r/a) 



(7) 



Here we again replaced the summation by an integration; 
{z(b)) is the average connectivity in the network which we 
already labeled as z. As the solitariness 6 of a vertex goes 
to zero, z{b) goes to infinity. By contrast, as b increases to 
infinity, z{b) has a lower bound which is equal to z/2. 

According to our previous discussions, we would like 
to generalize the model to exhibit a wide connectivity dis- 
tribution. To achieve this, high-degree nodes with small 
values of b must be present. However, the value 6 = 
is pathological for it makes the probability distribution 
Q{d;b) flat and creates a node with an infinite degree (if 



the network itself is infinite). Thus for the distribution 
g{b) we require g{Q) = 0. The simplest possible choice is 
g{b) = Kb^ for & e (0;B], /3 > 0. Values of K and B are 
fixed by the condition {z(b)) = z and by the normalization 
of g{b), leading to 



B = 



(a + a/3 — 2)z sin(27r/Q;) 



-1 -a/2 



(l+/3)27r2 

K = (l + f3) B-'-^+'^'^ 



(8) 



Since high degrees are due to small values of b, the results 
derived below hold for all g{b) which can be approximated 
by g{b) ^ b'^ for b small. Thanks to the constraint g{Q) = 
and the Taylor expansion, this is already a quite general 
class of functions. However, in this paper we focus on the 
power-law g{b) which allows us to investigate the model 
analytically. 

First we show that the chosen form of g{b) leads to the 
desired fat distribution of connectivities. For the distribu- 
tion of z{b) := k we have 



9{k)^g{b), 



dk 
db 



2K 



^l+/3+2/a 



Consequently, using Eq. ([7]) and assuming k ^ z we ob- 
tain 

g(k) _-^-(i+"/2+"'5/2) 

We already know that when b is given, the probability dis- 
tribution of the vertex degree is sharply peaked. Therefore 
we can approximate the distribution /(fc) which we are 
searching for by the distribution g(k) of the mean degree. 
Then we obtain 



f{k) ^ fc-("/3+"+2)/2^ p(fc) ^ fc-("/3+a 



)/2 



(9) 



Here P{k) is the cumulative probability distribution of the 
vertex degree. We see, that for the chosen g(b), the degree 
distribution has a power-law tail. In Fig. [1] this analyti- 
cal result is compared with a numerical simulation of the 
model for z — 100 and a = 3. The power-law character 
of P{k) is clearly visible for k > 300 and the approximate 
values of the power-law exponents confirm Eq. ^ . 

Now we show that the modified network model still ex- 
hibits the small world phenomenon. The probability that 
two vertices with a fixed distance I have the degree of sep- 
aration D we label as P{D). We can examine this quantity 
by techniques similar to those presented in [TSj. There it 
was shown that in the resulting homogeneous network, the 
first approximation of P{D) has the form 



P{D) 



HO 



{D + l)z^Q{l). 



(10) 



The derivation of a similar result P(D)he for the het- 
erogeneous network model proposed here can be found in 
Appendix A; it is well defined only when af3 > 2. In Fig.[2l 
the resulting ratio £,{D) := P(-D)he/^'(-D)ho is shown as 
a function of /3. Notice that in the limit /3 ^ oo all ra- 
tios ^{D) approach to 1. This is because as (3 increases. 
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P{k) 




/J =0.25 
/?=0.50 
y6=0.75 
basic 



100 



300 



1000 



3000 



1.6 



1.4 



1.2 



— D=0 
-— D= 1 

D=2 

— D=3 



1.2 1.4 



1.6 



Fig. 1. The cumulative degree distribution P{k) plotted in 
a logarithmic scale. The thick lines have slopes 1.95, 2.35, and 
2.70 respectively (values predicted by Eq. ((5)) are 1.88, 2.25, 
and 2.63 respectively). The probability distributions were ob- 
tained by 20 realisations of the model on the lattice with di- 
mensions 600 X 600 for a = 3 and z = 100. For a comparison, 
the degree distribution of the basic homogeneous model is also 
shown. 



a higher weight is given to values of b close to the upper 
bound B. In particular, in the limit /? — s- cx) all nodes share 
the same value of solitariness, B. Thus we can say that the 
proposed generalization is in the limit P oo equivalent 
to the original model. One can notice that £,{D) > 1 for 
all D. This means that in the proposed heterogeneous net- 
work the probability to find a path of a certain length is 
higher than in the homogeneous network. In other words, 
hubs (nodes with a high degree) present in the heteroge- 
neous network facilitate formation of short paths. There- 
fore, for the typical degrees of separation the inequality 
-Dhe < -Dho holds. On the other hand, since the ratios 
P{D + 1)/P{D) (which are of order of z) are much larger 
then the ratios £,{D) shown in Fig.IU we can also say that 
the introduction of heterogeneity to the network does not 
change the typical degree of separation substantially and 
Due ~ Duo- 

The average clustering coefficient (C) cannot be trea- 
ted analytically and therefore in Fig. [2] we present only nu- 
merical results. They confirm the expected fact that (C) 
is little sensitive to changes of the model parameters and 
thus it is almost as high as in the original model. One can 
also notice that with increasing f3, (C) approaches to the 
value 0.161 valid for the original model (this value is taken 
from [H], in Fig.[2]it is shown as a dashed line). This limit 
behavior is similar to the limit behavior of £,{D). Since we 
observe both a small typical degree of separation and a 
high average clustering coefficient, the heterogeneous net- 
work exhibits the small world phenomenon. 



5 Conclusion 

In this paper we investigated a network model where links 
are drawn according to nodes distances. Building on the 
basic model [18 , we proposed a generalization aiming to 
introduce heterogeneity to the network and also fat tails 



0.16^" 
0.15- 
(C) 0.14- 
0.13- 
0.12- 



0.5 



1.5 



Fig. 2. Changes of the the main network properties with l3 
for z = 300 and a = 3. In the upper figure, the ratio (,{D) := 
P{D)he/P{D)ho is drawn according to Eq. ^ and Eq. (fTT|l. 
The course of (C) in the lower figure has been obtained by a 
numerical simulation of the model with averaging over 1 000 
realizations; the dashed line presents the limiting value of the 
clustering coefficient in the limit l3 ^ oo. 



to the degree distribution. First, a hidden random param- 
eter b is assigned randomly to each vertex of the network. 
Then between a pair of nodes, a link is drawn with the 
probability depending on the hidden parameter values of 
these two nodes. As a result we obtain highly heteroge- 
neous network which exhibits a power-law distribution 
P{k) over a large range of connectivities. With respect to 
the social interpretation of the model, one can say that it 
produces a social network where highly sociable party go- 
ers are present along with loners. The proportion of highly 
connected nodes can be adjusted by the distribution, from 
which the hidden parameter b is drawn — in this work we 
focused on the simple distribution g{b) — Kb^. We also 
showed that for the resulting network, the typical degree 
of separation is small and the average clustering coefficient 
is high; both are approximately equal to the corresponding 
values for the homogeneous network with same z and a. 
Thus we conclude the small world phenomenon is present 
in the networks produced by the proposed model. 

MM would like to thank the Universite de Fribourg for the 
financial support and kind hospitality and to Paulo Laureti for 
enjoyable and helpful discussions. JS would like to thank Matiis 
Medo for introduction to the field and patient and agreeable 
collaboration. 
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Y 




Fig. 3. The exact diagram for P(2) (left) and the approximate 
one utilizing the properties of Q{d) (right). 



1 



2 



X Y X Y X Y 

Fig. 4. Three diagrams contributing substantially to P{2). 

A Calculation of P{D) in the heterogeneous 
network 

To obtain an approximate expression for P{D), the treat- 
ment is similar to the treatment of the homogeneous net- 
work model in [18] . As we will see, differences and compli- 
cations arise from the additional averaging over possible 
values of the solitariness h with g{h). 

We pick two nodes with a large geometrical distance 
I, let's label them X and Y . As an illustrative example 
we examine the probability ^"(2). That is, we examine 
paths between X and Y that have two intermediate ver- 
tices (Fig. [31 left). Notice that D = 2 requires that links 
X2, lY, and XY are not present. Since probabilities of 
these links are small, to obtain a first approximation of 
P{D) we neglect that such shortcuts may occur. Conse- 
quently, the diagram for P{2) is simphfied (Fig. [31 right) 
to the existence of edges XI, 12, and 2Y. 

Another simplification comes from the form of Q{d) 
given by Eq. ([3]). It is easy to check that when I is large, 
for d < / holds Q{l~d)Q[d) > Q{l/2)Q[l/2). This means 
that among all paths X12Y, the fundamental contribution 
comes from those which contain only one long link. More- 
over, for d ^ ^ we have also Q{1 — c?) « Q{1) and therefore 
the probability of the long link can be approximated by 
Q{1)- As a result we can further simplify the right dia- 
gram in Fig. [3l to Fig. [H where the only three diagrams 
substantially contributing to P{2) are shown (three dif- 
ferent possibilities appear because there are three ways to 
choose the long link in the path X12Y; since probability 
of the long link is always approximately equal to Q{1), the 
link is drawn between X and Y). 

In the basic network model, the contribution of the left 
most diagram in Fig. [Uto P{2) is 



Pi 



Qidxi)Qidi2)Qil)drxidri2 = Q{l)z^ 



1,2 



Since the remaining two diagrams give the same result, to- 
gether we have ^(2) « 3z^Q{l) which agrees with Eq. (fTO|l . 
In the modified model of a heterogeneous network, Q{d) 
is generalized to Q{d; b) and the connection probability 
is symmetrized by [Q{d;bi) + Q{d;b2)]/2. Then for the 
left most diagram shown in Fig. [4] we encounter the com- 
plex expression [Q{dxi;bx) + Q{dxi;bi)] x [Q{di2;bi) + 
Q{di2', ^2)] X [Q{1', ^2) + Q{1', by)]- Moreover, in addition to 
the integration over rxiTri2, we also have to integrate 



over bxjby ,bi,b2- Then we encounter the following inte- 
grals 



k{b)Q{b) Ab := z, 



Q{1; b)g{b) db « Q{1; B) := K^Q{1- B) 



/ I Q{ri;b)Q(l;b)g{b)dbdri 
Jo Ji 

_ a + al3-2 
a/3-2 

/7/ Qiri;b)Q{r2;b)g{b)dbdr^dr 

Jo JJl,2 



Q{l;B) := K2zQ{l;B), 



{a + al3-2y 



a{l + P){a + aP - 4) 



The third integral converges when a/3 > 2, the fourth 
when a + aP > 4 (since a > 2, this is a weaker restriction). 

Using the steps and notation introduced above we fi- 
nally obtain the approximate result 



P{DhB^^Q{l;B)z'' 



(11) 



where 



i(0) -ifi, 

L{l)^3Ki+K2, 

L{2) = 6Ki+AK2 + 2KiK3, 

L(3) = lOKi + IOK2 + IOK1K3 + 2K2K3, 

L(4) = IBKi + 2OK2 + 3OK1K3 + I2K2K3 + SKiKl . . . 

For D > 4 we obtain even more complicated expressions. 
Nevertheless, since the typical degree of separation is small 
in the discussed model, this is not a crucial complication 
and the solution is tractable. We can notice that in the 
limit /? — > CX3 we have i^i , -R'2 , ^^^3 1 and therefore 
P{D)he ~ P{D)ho as expected. 
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